Approaches to Microphone Independence in Automatic Speech Recognition
نویسندگان
چکیده
This paper describes a series of cepstral-based compensation procedures that render the SPHINX-II system more robust with respect to acoustical changes in the environment. The first algorithm, RATZ (MultivaRiate gAussian based cepsTral normaliZation) requires stereo-data for computing compensation terms, and is similar in philosophy to MFCDCN [ref] (in fact MFCDCN can be thought of as a discrete case of RATZ). We also describe a second algorithm, an improved version of CDCN, that does not require stereo training data and yet achieves performance levels comparable to the RATZ and other stereo algorithms. Use of the various compensation algorithms in consort produces a reduction of error rates for SPHINX-II by as much as 20.0% percent relative to the rate achieved with cepstral mean normalization alone, in both development test sets and in the context of the 1994 ARPA CSR evaluations.
منابع مشابه
Adaptation and Compensation : Approaches to Microphone and Speaker Independence in Automatic Speech Recognition
This paper describes recent efforts by the CMU speech group to address the important problems of robustness to changes in environment and speaker. Results are presented in the context of the 1995 ARPA common Hub 3 evaluation of speech recorded through different microphones at different signal-to-noise ratios (SNRs). For speech that is considered to be of high quality we addressed the problem of...
متن کاملSignal Processing for Robust Speech Recognition
This chapter compares several di erent approaches to robust automatic speech recognition. We review ongoing research in the use of acoustical pre-processing to achieve robust speech recognition, discussing and comparing approaches based on direct cepstral comparisons, on parametric models of environmental degradation, and on cepstral high-pass ltering. We also describe and compare the e ectiven...
متن کاملAutomatic Speech Recognition of Human-Symbiotic Robot EMIEW
Automatic Speech Recognition (ASR) is an essential function of robots which live in the human world. Many works for ASR have been done for a long time. As a result, computers can recognize human speech well under silent environments. However, accuracy of ASR is greatly degraded under noisy environments. Therefore, noise reduction techniques for ASR are strongly desired. Many approaches based on...
متن کاملA Database for Automatic Persian Speech Emotion Recognition: Collection, Processing and Evaluation
Abstract Recent developments in robotics automation have motivated researchers to improve the efficiency of interactive systems by making a natural man-machine interaction. Since speech is the most popular method of communication, recognizing human emotions from speech signal becomes a challenging research topic known as Speech Emotion Recognition (SER). In this study, we propose a Persian em...
متن کاملOff-line Arabic Handwritten Recognition Using a Novel Hybrid HMM-DNN Model
In order to facilitate the entry of data into the computer and its digitalization, automatic recognition of printed texts and manuscripts is one of the considerable aid to many applications. Research on automatic document recognition started decades ago with the recognition of isolated digits and letters, and today, due to advancements in machine learning methods, efforts are being made to iden...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003